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A GOMPERTZ FIT THAT FITS: APPLICATIONS TO CANADIAN FERTILITY PATTERNS* 


Résumé 


Dans cette étude on essaie de raffiner la méthode pour ajuster 

la fonction de Gompertz aux taux de fécondité cumulés en utilisant 

des techniques itératives, Cette méthode est expérimentée avec la 
série des données historiques de la population canadienne. On 

évalue l'implication démographique des paramétres de la fonction de 
Gompertz ajustée 4 la distribution de la fécondité et on étudie 
l'utilité de la méthode projettant les tendances futures de la 
fécondité. La fonction de Makeham est aussi ajustée a la distribution 
de fécondité par la méme technique et 1'éfficacité relative de cette 
fonction est comparée & celle de Gompertz. 


Abstract 


In this paper an attempt is made to refine the method of fitting the 
Gompertz function to the cumulative fertility rates by using iterative 
techniques, The method is tested with the historical data series for 

the Canadian population, The demographic implication of the parameters 
of the Gompertz function as fitted to the fertility distribution is 
examined, and the usefulness of the method in projecting future fertility 
trends is studied. The Makeham function is also fitted to the fertility 
distribution by the same iterative technique, and the relative efficiency 
of this function is compared with that of the Gompertz. 


* Grateful acknowledgement is made to the editor of Demography for 
permission to reproduce this paper. 
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1. Mathematical Functions in Fertility Studies 


The use of mathematical functions to describe patterns of fertility 
behavior is a common demographic practice. Many demographic practitioners 
have studied the joint effect of fertility and mortality as represented by 
the net maternity function, and several curves have been investigated as 
graduations of this function. Among the best known studies (Keyfitz, 1968, 
pp. 141-169) are those using the Gaussian or normal curve (A.J. Lotka), the 
incomplete gamma function (Wicksell), and exponential curves (Hadwiger). 

Others have attempted to find a graduation for schedules of age- 
specific fertility rates. These include Brass (1960), who used polynomial 
fittings, and Avery (1970), who compared several distributions as graduations 
of schedules of fertility rates. 

A further series of investigations has considered the Gompertz function 


as a graduation of fertility distributions. This function has the form 


Yosuk A: OE POR eee ee ee oe 


It was developed as a description of mortality curve (Miller, 1946, pp. 42-47), 
but work by Martin (1967) and Wunsch (1966) has demonstrated that this function 
might provide a useful description of fertility distributions. 

Martin fits the function by the method of selected points and Wunsch 
used a variation of this, partial totals, which in effect uses the mean value 
over selected ranges of the function. Selected point methods are not completely 
satisfactory, as will be discussed below, but despite this, both Martin and 
Wunsch achieved good fits to their data. 

Romaniuk (1969) later applied these techniques to cumulative fertility 
rates for selected cohorts of Canadian women. He also obtained some good fits 
to the data, but he was primarily concerned with extrapolating the rates of 
cohorts with partially completed fertility as part of a projection exercise. 
The Gompertz function is not well suited for this purpose (see Section 5 below), 
as Romaniuk recognized in adopting another method, i.e., the chain ratios, for 


his extrapolations. 


oe 


2, Method of Fitting that Minimizes the Sum of Squared Error 


Although the selected points method can provide good fit to the 
data, there always remains an element of uncertainty in such a procedure, 
In discussing the use of the Gompertz function and the related Makeham 
function in mortality analysis, Miller (1946) notes: 

" ,.it is not advisable to determine the numerical values 

of the constants by the use of isolated points for, -on 

account of the irregularity of the observed series, 

different sets of points produce widely varying values. 

There is no way of choosing four points beforehand with 

any assurance of obtaining a suitable representation of 

the data." 

Fitting to a cumulative curve, as did Martin, reduces the problems 
arising from selected point fittings but does not eliminate them. Our goal 
is a method that incorporates all of the information available, and which 
minimizes the standard error measure, i.e., the sum of squared differences. 
Such least squares fits are straightforward for functions in which the 
constants enter linearly, but in the present case we must make recourse to an 
iterative procedure. 

Keyfits (1968, pp. 214-215) presents a discussion of the descent 
method, which has been adopted here; we shall briefly summarize. The function 
is 


t 
YUEN SRA russ ite ee one ee ey 


Suppose Ps is the observed cumulative fertility at age i, i going from 
m to n, where m and n are the initial and final ages for which the observed 


fertility is available. 


Let ky ae and bo be an initial set of constants, and let f , (0) be 


the value of fertility at age i, estimated by substituting the initial set of 


constants in the Gompertz function. The sum of squares of deviation for the 
Oth iteration is given by 


th 


2 
oF = 2 (£, (0) - P,) eoeeoeseseereee ee eee eeeeeee es (3) 
Th 


Sn 


Now, let k = ky = hy AeA. ew een bo + h, be the next approximations 


2% 
of the constants; if Sy is the sum of squares of deviation with this improved set, 


then by expanding S in the neighborhood of the initial set of constants KF 5225» 


by Taylor's theorem, we get 


2 95 98 35 
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where 9S/ dk, represents the value of the partial derivative of S with respect to 
k, with the values of the constants being taken as ko285> and bo3 and similarly 


for the other partial derivatives. 


We will seek to minimize S$ this involves deriving the partial derivatives 


h? 


of sy with respect to k,a, and b and equating them to zero, thereby producing the 


correction factors hy sho, and h,. Thus 
2 2 2 
as OwS. Tad, ots CMs) 3 
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Writing in matrix form: 
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Or 


Thus correction factors can be found by premultiplying the vector 
of first partial derivatives by the inverse of the matrix of second partial 
derivatives, Initial values for the constants may be obtained by using the 
method of selected points, as suggested by Martin (1967). Following hin, 


we make a transformation of the origin to to thus 


Y(t) = ae “0? 
Martin shows that 
B = Qn Y(2) - &n ee 
G Y(1) - &n ao 
A = Exp fn 1G) = ts FO) ) MeN iether yah Men 7 
( pew F 
K) = Exp..€.,0n, Y(0) = fn. A) 


where Y(2), Y(1), and Y(O) are the observed values at the selected age-points 
t(2), t(1), and t(0) respectively, and r is the age-distance between two 


consecutive selected points. 
2.1 Data Fitted and the Computer Routine 


All the arithmetical operations in the programmed routine were performed 
in double precision, and because of the computational convenience, the fertility 
rate per woman was considered instead of the rate per thousand women; the input 
data for the program were (a) the cumulative fertility rates; (b) the beginning 
and the ending ages of observed fertility: and (c) the selected age-points to 
start the process of iteration. eundingien was also made in the program to 
extrapolate the fitted values up to age 49 if the observed rates were not 
available up to that point. 

Data tested (Romaniuk, unpublished) were for the cohort rates for Canada 
from 1911 to 1945 and the period rates from 1926 to 1969. Almost invariably the 
convergence was reached in three to four iterations; the reduction in the sum of 
squares of deviation was impressive, resulting in the decrease in some cases of 
as much as 90 percent. As a rule, ten iterations were performed in each case. 


Tabie 1 gives some idea as to the efficiency of the method. 


Bk 


2.2 How Good is a Good Fit? 


This decent method, then, provides a much improved fit in most cases; 
however, one must still face the question of how good, in demographic terms, 
is the fit. For example, the Gompertz curve fitted to the cumulative fertility 
rates of the 1911 cohort of Canadian women gives a sum of squared error of 
0.04183; this represents an improvement of 93 percent over the curve fitted by 
selected points, but does not completely answer the question. To obtain an 
estimate of goodness of fit in demographic terms, we apply a demographic 
technique, standardization. 

Aside from its possibilities of providing a succinct group of fertility 
parameters, a major use of the Gompertz function in fertility study might be in 
estimating the total number of children born to a cohort of women or during a 
period of time. This suggests that we:measure the error that would be introduced 
into the estimate of total births by using the fitted instead of the actual 
rates. 

As in all standardization exercises, the results permit two interpret- 
ations. One is that we are comparing the expected number of total births to a 
cohort, or during a period, with the births that would be produced if a 
population combined the given birth rates with the age composition of the 
standard population. The other interpretation is that we have chosen to weight 
the error in the fitted rates at each age by the "importance" of that age group, 
with importance being measured by the relative numbers of women in each age 
group in the standard population. 

Along with the usual interpretations of standardization we face the usual 
problem, that of choosing a standard population. We reject the notion of using 
the actual population for each cohort or period, as such shifting standards would 
make comparisons unclear. What is desired are standard populations, one for the 
cohorts and one for the periods (which will be considered later) that yield an 


overall indication of the importance of each age group over the range of data 


in question. 
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For cohorts the choice ideally would be the stationary population 
from an appropriate cohort life table. The choice is not critical here, as 
mortality patterns in the ages 14 to 49 have not changed drastically over the 
period of the data. Under conditions of stable mortality, period and cohort 
life tables coincide, so a period life table is also appropriate as a standard. 
We have chosen as standard for the cohort comparisons the latest available 
official life tables for Canadian women, i.e., those for the period 1965-67 
(Canada. Dominion Bureau of Statistics 197la). | 

Neither is the choice critical for periods, as the Canadian age structure 
has not changed drastically during the periods covered by the data. Because it 
seemed desirable to choose a pattern representing the most recent age patterns, 
the standard population for periods was taken as the age distribution of Canadian 
women in the 1966 Census as graduated from five-year age groups using Sprague's 
multipliers (Canada. Dominion Bureau of Statistics, 1971b). 

Thus armed with standardization and standard populations, we can attempt 
to answer the general question of how good, in demographic terms, are our fitted 
curves, and in particular, how good is the fit to the 1911 cohort. 

We find that using the fitted, as opposed to the actual, rates would have 
produced an error of 2.80 percent in the age-standardized total births for the 
1911 cohort. A three percent error is not overwhelming, and as we shall see most 
of the standardized errors are much lover. To give some terms of reference we 
could consider the simple approximation suggested by Keyfitz (1968, p. 350) to 
the sampling error of a standardized rate. Applying this technique to our data 
yields variances that are of the same order of magnitude as the net errors, such 
as 2,47 percent of the expected value for the 1911 cohort. 

From another perspective, we can say that given only the three parameters 
of the Gompertz function, we can estimate the age-specific fertility rates of the 
1911 cohort accurately enough so that the estimate of the standardized total 
births would be in error by only 2.80 percent. Not only is this error low; one can 
see from Table 2 that it is one of the poorer fits over the range of available 
Canadian data. 

The usual problem with cohort data arises here in that the data for many 
of the cohorts are severely truncated. Data are complete up to age 45 for the 


cohorts of 1920 and earlier. _Confining our attention to these cohorts, we see that 


Ma om 


the net error ranges from 1.62 to 2.87 percent. This small error seems an 
acceptable cost to pay for reducing the information contained in 36 numbers, 
the rates by single years of age, to three parameters. 

The present method yields fits that are best in the middle of the 
distribution; the fit in the tails is less satisfactory. Thus, the error of 
the fitted distribution is reduced as the cohort data are truncated. If we 
broaden the range of permissible data to include all those cohorts for which data 
through age 30 are available (in this case the cohorts of 1935 and earlier), we 
are still in the range that comprises 60 percent or more of the total fertility 
of the cohort. (This figure is based on the cohorts with completed fertility.) 
For these cohorts the fit is much better, ranging from 1.36 percent for the 1921 
cohort to 0.00 percent (actually 0.0001 percent) for the 1931 cohort. Table 3 
gives the complete distribution for the 1920 cohort as an example. 

The better fit of these later cohorts represents a confounding of two 
effects. The first is the truncation effect, eliminating the tail of the 
distribution, where the curve doesn't fit so well anyhow; the second is the fact 
that the fit is intrinsically better with younger cohorts. This will be discussed 
in some detail below, but in brief it stems from the fact that as one moves from 
earlier to more recent cohorts, the fertility patterns of Canadian women move 
into a range in which the Gompertz function fits more efficiently. 

We have been discussing the net error of these Gompertz fits. Table l 
also presents the gross error. This is the sum of the absolute value of the 


differences at each age. The net and gross error are defined as 


Net error = al i a 


Gross error 


It 
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where ry and ri are the actual and estimated rates by age, respectively, and 


k is the standard population by age. The gross errors range, for cohorts of 


1935 and before, between 2.5 and 14 percent. Which measure of error to consider 
as appropriate depends upon the intended use of the fitted curve. For those 


studies which refer to the entire cohort, such as parameterizing the fertility 


ay ts 


experience or graduating the data, the net error seems the appropriate measure. 
In exercises which require the use of parts of the distribution in conjunction 
with other cohorts, such as translating the data for a number of cohorts into 
period data, the gross error would be appropriate. 

Given these criteria, the fitted curves would be very good for the first 
type of exercise, but poorer for the second. Thus the Gompertz curve would not 
be well suited to what has now become almost the standard fertility projection 
technique - making estimates for cohorts and translating the results into period 
rates. 

It should also be noted that the gross error measure assumes complete 
accuracy in the rates by single years of age, which is a questionable assumption. 
It may be that the graduated rates themselves are a better description of the 


underlying fertility process. 


2,3 Fitting to Period Data 


Analytical functions such as the Gompertz reflect a series of assumptions 
about the orderly workings of an underlying process. Schedules of real cohort 
rates represent the passage of a group of women through the ages of fertility, and 
it seems likely that they would have followed the kind of process underlying the 
analytical distribution. Thus cohort rates have seemed likely candidates for 
yielding close fits; they have seemed to be better candidates than period rates, 
‘which represent the experience of different groups of women. 

However, to the extent that a given cohort of women is subjected to 
period phenomena which influence fertility - wars, door a eeene and such ~ they are 
less likely to meet the conditions ecescaryittar a close fit to the function. 
Martin (1967) makes this point in discussing his choice of the 1897 cohort of U.S. 
women for testing his- method. 

Schedules of period fertility rates also contain a fundamental order that 
is imposed by the strong physiological and social forces which arise from aging. 
Mortality analysis is largely founded upon this type of orderliness, but it has 
often received little emphasis in fertility analysis. In addition, period fertility 
rates, by definition, hold constant those period phenomena that disturb the cohort 
rates, Thus one might expect that period rates would fit our analytical function 
sbout as well as do cohort rates. Period rates also are free of the truncation 


problems which arise with cohort rates. 
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Table 4 presents the results of fitting the Gompertz function to 
period data for Canadian women from 1926 to 1969. (As was discussed above, 
the graduated age distribution for Canadian women, 1966, was chosen as the 
standard population.) 

As the table indicates, the fits are at least comparable to those 
obtained for the cohorts. The net error ranges from 0.72 to 1.87 percent, 
gross error from 7.00 to 12.02 percent, and the sum of squared error from 
0.0053 to 0,0417. 

As with the cohort data, the fit improves as we move from the earlier 
to the more recent data. For the periods of the 1960's, the net error in 
estimated total births for the period is, save one exception, less than one 
percent. Since there are no truncation problems with period data, this steady 
improvement of the fit as one moves from earlier to recent periods is due largely 
to the fact that the more recent fertility patterns lie more comfortably in the 
usable range of the Gompertz function, as will be discussed in the next section, 
It should be noted, however, that the data themselves are smoother for the later 
periods, probably reflecting improved reporting. 

For period distributions it is probably the net error that is the best 
measure of the efficiency of the fit. The principal uses of the fitted curves 
would be in projection, graduation, and parameterization, areas in which the 


net error seems most appropriate. 
3. Demographic Interpretation of the Parameters 


The ability of the Gompertz function to reproduce accurately, from three 
parameters, the age-specific fertility rates for these Canadian data suggests that 
we have before us a useable means of parameterizing the fertility experience. 

The next step in this process is to provide a demographic interpretation of the 
three parameters of the Gompertz function. We shall see that one of the 
advantages of this function is that a demographic interpretation is relatively 
straightforward. | 

As mentioned above, it is desirable to make a shift of origin on the age 
axis; for the iteration process used here, we have taken 24 as the origin. Thus we 


have taken 24 as the origin. Thus we have for the Gompertz function 


~|6— 


plt-24) 
Y = KA eeeeveeveeeeeveeeteeeeeeeeeeeeeeoee ee ee @ (10) 
where Y is the cumulative fertility rate by age, and t is age; what, to the 
demographer, are K, A, and B? 
In the range that interests us, B will be less than one; thus as t 


: ee t-24 
increases without limit, 3S ) 


approaches zero and Y approaches K. Thus K 
is the assymptote, the upper limit of Y, and in general can be interpreted as 
the total fertility rate of the cohort or period. 


At the origin, X = 24, we have 


Y = KA" = KA. 


Thus A is the proportion of the total fertility rate that has been attained by 
age 24, a common demographic measure often used to indicate how much of child- 
bearing is concentrated in the early years. 

The remaining parameter B is not completely straightforward, but as is 
apparent from the curves of Graph 1, B is related to the degree to which age- 
specific fertility is concentrated about the peak age of fertility, to the moments 
of the fertility distribution. 

The curves plotted in Graph 1 are the first derivatives of the Gompertz 


function 


3 eX,) p kX ) 
Z(x) = dY/dX = KA QnA Sh ee eS ee Gk 


When the Gompertz curve is interpreted as the cumulative fertility rates, the 
derivative represents the age-specific rate at exact age X. As Graph 1 demonstrates, 
these curves flatten out as B is increased while holding A and K constants. As B 
increases, the rates in the middle of the distribution decrease, while those at 
the tails increase. Thus the curve is very peaked for B = 0.7, very flat for 
B= 0.9, . 

To give some idea of the ranges and magnitudes involved we use some of 
the properties of the Gompertz function discussed by A.L. Titus in Appendix A. 
He shows that the curve of age-specific rates implied by the Gompertz fit must 


have the shape shown in Graph 1, and that a unique maximum exists, say at age x 


5 fr 


For present purposes, we take the origin at x and investigate the 
partial derivative of (15) with respect to B (to simplify the notation we 
write t =X-xX). 

m 
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Collecting terms and using H &n G = &n a gives 


t 


B ali Be 
dZ/9B = (KA 4/B) Sn A E +t &n B (’nA 


= Dl reacrad secs 12) 
Since the sign of 0Z/dB will depend only upon the term in brakets, we let 
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We make use of another of the results shown by Titus (see Appendix A); he shows 


that within the range of constants appropriate here, the constant A must equal 


-l1 : : & : s . 2 
e whenever the origin is fixed at the age of maximum fertility. Since we 


: : : -l 
have chosen the origin here as at this maximum age, we have A = e' and 


CR te haee bake OME AC Reta) in edad Oe oa cealy ues ode a tel 109 


We see at once that at the origin, the age of maximum fertility, 
F(B,t) - 1, and 


Bo Be 
OZfap ="(RA C/BY*kn7A < 0 


t 
since &n ia” is always negative and the other terms positive. Thus at the origin, 
when B increases Z (t) decreases; in fact this will be the case whenever F(B,t) 
is positive. 


Recalling that 0 < B < 1, we see that 
ECR) <b for ail its 
and furthermore that as 


Ce (Bit -8 
t >-~, F(B,t) + = 
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and that F(B,t) is monotonically decreasing with t to the right of the origin, 
monotonically decreasing to the left. Thus, F(B,t) = 0 and 92Z/93B = 0 at two 
and only two values of t, once for t positive, once for t negative. These two 
roots, say ty and t, define a band of the Z(t) curve. This band is roughly 
centered about the age of maximum fertility as the origin. Within this central 
band of t values, Z (t) decreases as B increases, i.e., dZ/dB is negative; for 
t to the left and right of this band, Z(t) increases as B increases. Thus as 

B increases, the curve is flattened out and the variance of the curve is 
increased, 

The three parameters of the Gompertz function, then, have clear 
demographic interpretations: 

K: the total fertility rate 

A: the proportion of total fertility completed at the origin, 

here at =. = 24 

B: an indication of the variance of the distribution, of the spread 

of fertility over therace “span. 

We are now in a position to undertake for purposes of illustration a 
brief discussion of the changes in Canadian fertility patterns over the range of 
our data. But first we must discuss two properties of the Gompertz function which 
affect its use in fertility analysis. 

The first of these is discussed by A.L. Titus in Appendix A. He discusses 
‘the first derivative of the function, which can be interpreted as the age- 
specific fertility rate at any instant. It shows that these rates increase 
monotonically to a maximum, and decrease monotonically after that. This seems 
reasonable, more so than his other point that at the age of maximum fertility, 
exactly a or about 3/7 percent of the total fertility is completed for all sets 
of parameters within our range of interest. This is a curious property, but even 
more curious is the fact that, judging by the good fit of this function to the 
data, it is also a property of Canadian fertility patterns. 

A second important property is apparent in Graph 1. The curves in this 
graph have been normalized to an upper assymptote of zero. We see that for 
relatively high values of B, the parameter controlling the spread of the distribution, 
the function does not approach the assymptote within the age range of human fertility. 
Table 5 presents some numerical values. A glance ahead to Tables 6 and 7 will . 
demonstrate the importance of these figures. For most of the data presented, B 


falls within the range from 0.80 to 0.86. 
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GRAPH 1 


DISTRIBUTION OF AGE-SPECIFIC FERTILITY RATES FOR DIFFERENT 
VALUES OF THE CONSTANT B (OF THE CUMULATIVE CURVE) 
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The fact that B decreases as one moves from earlier to more recent 
groups of women helps to explain why the fit of the curve is better for these 
more recent groups: the fit is better because the function is better behaved 
in this range. The behavior of the function for this range of B also helps 
to explain why attempts to achieve a better fit to the upper end of the 
distribution by fixing K at the total fertility rate and iterating only on 


the other two constants have not been successful. 


4. The Changing Pattern of Canadian Fertility 


To illustrate the use of this form of parameterization, we assume that 
all of the fits are equally tolerable and observe the trends in the constants. 
Table 6 presents the constants for the series of cohorts, Table 7 for the series 
of periods. 

The most striking aspect of these series is that they are very orderly. 
For the cohorts, K, the equivalent of the total fertility rate, increases 
steadily from 2.89 for the 1911 cohort, peaks at 3.64 for the 1929 cohort, and 
declines steadily after that. The two parameters that summarize the timing of 
fertility A and B also show an orderly pattern of change. A, the proportion of 
fertility completed at age 24, remains roughly constant at about 24 percent from 
the 1911 to the 1915 cohort and increases steadily after that. B, the measure 
of the age spread of fertility, declines for the first five cohorts, reaches a 
plateau at about 0.847 through the 1919 cohort, and declines steadily after that. 

Experiments indicate that truncated data sets tend to underestimate B 
and overestimate A, so one must exercise caution in interpreting the parameters 
for the more recent cohorts; however, one can state with some assurance that the 
pattern of cohort fertility has shown a trend toward births being concentrated 
in the younger ages along with a reduction in the variance of the birth rates by 
age, a squeezing together of the fertility distribution. 

For the period data, there is no truncation problem, and one can use the 
entire series with some confidence. Here we see the same orderly movement of 
the parameters (Table 7). We find that K, the total fertility rate indicator, 
follows the well-known pattern: a decline from 1926 through the depression years 
of the 1930's, a moderate recovery during the war years, a strong recovery after 


1945, peaking in the years from 1957 through 1959, and a very strong decline after 
that, 


ey oe 


The two parameters of timing present a pattern of striking consistency. 
With one exception both series are nearly monotonic. The proportion of fertility 
completed by age 24, A, increases steadily from 23 percent in 1926 to 39 percent 
in 1969, The parameter measuring the spread of the fertility distribution, B, 
shows a steady decline, indicating a consistent tendency toward a squeezing 
together of the fertility rates by age. The exception to this steady change is 
a plateau in both series during the depression years of the 1930's; during this 
period A fluctuated around 23.5 percent and B around 0.861. 

From this perspective, the much discussed boom—-bust-war and boom cycle 
of the period between 1925 and 1960 seems to have had its principal disruptive effect 
on the total fertility rate of the periods involved. The timing parameters 
throughout the entire period were undergoing a very steady evolution toward 
younger and more concentrated fertility patterns. The depression halted this 
evolution of the timing patterns, but the process continued unabated through the 
war, the postwar baby boom, and the sharp decline of fertility in the 1960's. 

The reader is referred to the error terms of Tables 3 and 4 for a reminder 
that these parameters do seem to provide a fairly accurate summary of the fertility 


experience. 


5; Possible Use in Projections 


A function that predicts total births accurately from a few parameters 
surely has implications for fertility projections. The fertility assumptions could 
be framed in terms of the parameters in the light of historical patterns, and the 
possibility is opened of linking these parameters to other data series such as 
economic projections. 

The Gompertz function does appear promising for at least one strategy of 
fertility projection, which will be the subject of a later communication. However, 
it should be noted that the Gompertz function does not seem to appear well suited 
for use with the cohort-translation method of estimating fertility. This method 
makes its assumptions on fertility in terms of birth cohorts of women, and then 
translates schedules of cohort rates into period rates. It presents several 
difficulties (as do all methods), and the Gompertz function may not help in 
solving them. 

The major difficulty with the cohort-translation method lies in translating 
the projected cohort rates into period rates. Usually the direct translation 
results in a series of period rates that may be unacceptable in the light of 
historical series. Ad hoc devices are usually employed to batter the period rates 


into shape, and the use of the Gompertz function may not offer immediate improvement 
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over these devices. Indeed, the high gross error of the estimates relative 
to the net error seems to indicate that caution should be exercised in the use 
of the Gompertz function for translation. However, further research may enable 
one to link explicitly the cohort parameters to those for the period, which will 
permit one to estimate the period parameters directly from those for cohorts. 
Another possible use of a function such as the Gompertz might be to 
extrapolate the experience of cohorts of women with partially completed fertility. 
Again, this places us in an area where the function does not perform well. 
Experiments (to be reported in detail in the later communication on projection) 
indicate that truncated data sets do not yield accurate predictions of the final 
parameters; and even if they did, the function does not fit well at the tail end 
of the distribution. Tabie 8 indicates that even for those distributions which 
yield a very good fit to the data, the fit in the upper tail is relatively poor. 
In the next section, we discuss a related curve that does a better job in this 


respect at the cost of adding another parameter. 
6. The Makeham Function 


Although one hates to force another parameter into a process that already 
yields a good fit, completeness of discussion plus the relatively poor fit of the 
Gompertz function at the upper tail of the distribution suggests an investigation 
of a curve closely related to the Gompertz function, i.e., the Makeham curve. 


The form of the Makeham curve is 


Ye KS AD soc dele ee ene ee ae, TT 


This curve arises when one postulates a constant force of mortality (or fertility) 
operating along with that which varies with the size of the existing rate at a 
given age (Forsythe, 1924, pp. 67-68). 

The procedure for fitting this curve by descent methods is essentially 
the same as for the Gompertz, except that there are now four parameters instead 
of three (see Section 2 above). Tables 9 and 10 summarize the results of fitting 
the Makeham function to selected Canadian data. The fit is indeed improved, as these 
tables indicate, Despite this improvement, we would not recommend the use of the 
Makeham over the Gompertz function, except perhaps for extrapolation of partially 
completed fertility. 
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One of our main goals is a succinct set of parameters, and moving from 
three to four is a real loss in compactness. Furthermore, the addition of the 
term s* destroys the rather straightforward demographic interpretation of the 
parameters, Finally, except for the case of S = 1, when we are back to the 
Gompertz, the Makeham curve does not approach K symptotically, but increases 
without limit when S is greater than 1 or approaches zero when, as with the 
present data, S is less than one. For some of the early Canadian data the 
Makeham fit turned down before age 49, yielding cumulative rates that decline 


and implying negative age-specific rates. 
io conctusiom 


As with most exploratory studies, this one has raised more questions 
than it has answered. We have shown that the Gompertz function, as fitted by 
the proposed method, yields good fits to a broad range of Canadian fertility 
patterns, and that these fits are close enough to suggest the three parameters 
of the Gompertz function, parameters which have clear demographic interpretations 
and are likely candidates for a parameterization of Canadian fertility patterns. 
The next steps, currently underway, are a detailed investigation into 
the use of the proposed fitting method in Canadian projection work; the use of the 
function in the problem of translation of cohort into period fertility phenomena; 
a study of its range of application beyond Canadian data; and, if it does prove to have 


a fairly broad range of application, its use as a graduating function in analytical 


work. 
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APPENDIX 


Some Properties of the Gompertz Function 
by 


AGL batus 


Some implications of fitting the Gompertz function to rates of 
cumulative fertility by age are revealed by examining the derivatives of the 


function. Given the function: 


Y = KA RTeraM ee aiortics sacar dieohaecde sien eats as ae Gaeta chains aie ea LD 


where K, A, B, and X, apecGonustanto with OL“ Asc.) < BU 1 rand & 2.0. then 


X-X 
EE eee Oar ee ee ee ne ec: 


and 


x-X 
devas 0 oe (et Ae oes) Ch aes homcanenin) aa oddiaeA a. 3) 


If X is age, and Y is the cumulative fertility rate by age, then K is equivalent 


to the total fertility rate, and A is the fraction of K completed by age Xx? 


moreover, the first derivative of Y with respect to X, (equation 2), is the 
birth rate at exact age X, and the derivative of this function, i.e. the second 
derivative of the Gompertz function (equation 3), is the slope of the curve of 


instantaneous age-specific fertility rates. 


By definition of the constants the product of the first two sets of 
terms in brackets in (3) is always negative. Thus the slope of (2) is determined 


by the sign of 


Be X, $n A +1) escet®eeeeeereeeeeeeveoeeveevn eevee eeevevee eevee ee ee se @ (4) 
Since 0 < A < 1 then &n A is negative. When in equation (4) the term 


Tare Xn A is less than -l, the expression will be negative and the slope of 


equation (2) will be positive. 
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Similarly, when 
X-X ; ; 
Bo &£n A > -1, the slope of (2) will be negative 
and when 
X-X ‘ ; 
B o $n A = -1l, (2) will be at a maximun. 
We note that at this maximum point, the age of the maximum age-specific 
fertility rate in the present interpretation 


X-X 
Qn a keen 


or 
roe 2) Sarin act 
Thus fitting the Gompertz function to cumulative fertility rates implies 
that the curve of instantaneous age-specific rates will have the following 
properties: 
(a) it will increase monotonically to a maximum and decrease 


monotonically after that, 


(b) at the age of maximum fertility, exactly a about 37 per cent of 
the total fertility will have been completed, no matter what the 


values of K, A, B, and Xo 


The first property is as expected; the second is a surprising peculiarity. 
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TABLE 1. Distribution of Experimented Sets of Cohort and Period Fertility 
Rates by the Percent Reduction in the Sum of Squares 
of Deviation Due to Iteration 


Percent Cohorts * Period * 


Reduction in 7 
Sum of Square Frequency Per cent Frequency 
of Deviation 

2 = 


90 + 

i> to 90 
50° to: 75 
40 to 50 


Less than 40 


* Cohorts tested were for Canada for the years 1911-1946 and periods were 
for 1926 to; 1969; 


aa 


TABLE 2. The Final Sums of Squares, Net Error and Gross Error 
for Cohort Rates for Canada, 1911-1945 


Final Net Gross 
Year SS Error Error 
1911 0.041826 2.50 13.26 
1912 . 0.041708 2.79 13%25 
1913 0.041133 2.87 12407 
1914 0.033225 2.49 11.09 
1915 0.029024 | 2.96 10.48 
1916 0.025934 2.28 9.97 
1917 0.022754 Zeek 9.50 
1918 0.017063 704 9.40 
1919 0.016748 1.89 9.05 
1920 0.011885 E66 oie 3 0S: 
1921 0.008389 Beso 7.49 
1922 0.005827 SFG Be 6.87 
1923 0.005965 0.76 6.46 
1924 0.006116 0.54 5.0m 
1925 0.006140 0233 5.035 
1926 0.005045 Oakey x.L9 
1927 0.003760 : 0.14 4.40 
1928 0.002468 0.16 4.07 
1929 0.002237 0.11 S768 
1930 0.002323 0.09 at A 
1931 0.001686 0.00 350 
1932 0.001209 0.02 Zale 
1933 0.001160 0.18 Su ouk 
1934 0.000730 O21 2.88 
1935 0.009403 0.15 2.42 
1936 0.000270 0.14 2.06 
1937 0.000135 0.08 LwS2 
1938 0.000126 0,20 1,382 
1939 0.000096 OF23 A RAY f: 
1940 0.000034 Osh 1.20 
1941 0.000029 O.11 y ee SS 
1942 0.000012 O<07 0.99 
1943 0.000010 0.09 wl 0S 
1944 0.000004 0.01 1220 


1945 0.000001 0.05 0.96 


ae 


TABLE 3. Actual and Fitted Values (Cumulative) 
for the Canadian Cohort of 1920 


SSS 


Percent 

Age Actual Estimated Diff. 

14 0.0006 0.0027 - 377.5440 
LS GO. 0052 0.0080 - 160.3258 
16 0.0105 0.0203 ae 926) 902 
7 0.0307 6.0446 - 45,2292 
18 0.0724 0.0869 - 20,1617 
19 0.1455 0.1529 - 5.0923 
20 0.2508 0.2466 1.6670 
Palt 0.3879 0, 3696 4,7096 
Z2 0.5447 0.5207 4.4069 
23 0.7090 0.6959 1.8437 
24 0.8835 0.8898 - 0.7060 
25 1.0889 1.0956 - 0.6082 
26 f. 3538 1.3066 0.5475 
27 ILO 2 Le0406 - 0.0368 
28 Ped¥53 Laie GS - 0.3284 
29 1.9002 1.9152 ~ 0.7919 
30 2.0784 2.0968 ~ 0.8819 
31 2e2401 2.2639 ~ 1.0603 
az 2.3996 Cate, = 0.6728 
33 24935 Yh ns WAP ~ 0.2740 
34 2.6739 2.6739 0.0002 
3D 2.1892 2.7814 0.2798 
36 2.8947 DEE pots) 0.6554 
i Ts 2.9837 2.9582 0.8539 
38 3.0621 3.0298 1.0536 
39 321245 3.0919 1.0457 
40 3.1739 3.1454 0.9000 
41 3.2084 3.1914 0.5312 
42 3e2359 aot BUD 248916 
43 342505 3.2647 - 0.4377 
44 3.2596 3.2937 - 1.0434 
45 3.2641 3.3184 ~ 1.6627 
46 3.3394 

47 365510 

48 mR ws S 

49 33600 


The constants are A = 3.4581, B = 0.8467 and C = 0.2573 
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TABLE 4. The Final Sums of Squares, Net Error and Gross Error 
for Period Rates for Canada, 1926-1969 


Final Net Gross 
Year Ss Error Error 
1926 0.041696 1.86 12302 
1927 0.042620 Pood 122799 
1928 0.040895 P85 13.09 
1929 0.034042 Eee 11.80 
1930 0.034385 1.69 11.88 
1931 0.028241 L265 10.18 
1932 0.027092 190 13,16 
1933 0.027170 1.80 EEZ52 
1934 0.028290 1.88 aE gl 5 
1935 O,02Z7734 1,86 11.48 
1936 0.027788 1.89 1Liss2 
1937 0.023837 1200 LP 522 
1938 0.024061 sR eS 11.94 
1939 0.022343 Po67 LPs 67 
1940 0.021919 1.54 11.40 
1941 0.019407 1.40 10.41 
1942 0.020463 2 ey: 10.72 
1943 0.020810 1.44 10.40 
1944 0.023691 1353 10357 
1945 0.026097 ets 10.81 
1946 0.027418 SRS 10327 
1947 0.030079 123 10.44 
1948 0.026075 1,23 10.10 
1949 0.023668 128 9.46 
1950 0.022809 1,26 9,28 
1951 0.024752 P25 9.76 
1952 0.028168 Poeg 10,20 
1953 0.031159 5 ee! 10.68 
1954 0,033256 SS ke 10, 86 
1955 0.032883 bie 10.51 
1956 0.029107 1.04 9.57 
1957 0.031558 1,0f 9.72 
1958 0.029678 1.00 9,84 
1959 0.028585 0.95 9,62 
1960 0.037584 1.08 13520 
1961 0.028168 0.89 9,64 
1962 0.026963 0,88 9.80 
1963 0.025980 0,89 9,95 
1964 0.021279 0,89 S023 
1965 0.016754 0.91 Oo74 
1966 0.012176 0.83 8.95 
1967 0.009615 Ost 8.55 
1968 0.005409 O,72 7.00 


1969 0.005259 0.73 7.00 


enti: 


TABLE 5. Percent of Total Fertility (K) Reached by 
Ages 45 and 49 for Different Values of B 


(A = 0.272) 

B aeetey Ace 45 Age 49 
.800 98.81 99.51 
825 97.73 98.94 
.850 95.80 97.79 


res di. 92.42 95.48 
- 900 86.72 91.08 
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TABLE 6. The Parameters K, A, and B for Cohort 
Distributions for Canada, 1911-1945 


2.8855 0.8593 
29502 0.8569 
3.0343 0.8541 
3.0600 0.8501 
302733 0.8485 
3.0101 0.8471 
3.0733 0.8476 
3.0434 0.8473 
3.4128 0.8478 
3.4581 0.8467 
3.4828 0.8456 
3.4134 0.8438 
3.4565 0.8423 
3.4945 0.8411 
3,4815 0.8400 
3.4680 0.8390 
35216 0.8367 
3.4974 0.8333 
3.6449 OcBSLt 
3.6185 0.8268 
3.5823 0-829 2 
3.4656 0.8171 
3.3476 058135 
Ae2Ore 0.8073 
3.2146 0.8018 
3.0316 0.7943 
2.9435 0.7870 
2.8016 0.7797 
2.6095 0.7726 
2.4203 0.7645 
2.1883 0.7590 
1.9797 057523 
1.6690 0.7442 
1.4417 0.7436 
1.3149 0.7397 
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TABLE 7. The Parameters of K, A and B of Period 
Distributions for Canada, 1926-1969 


Year K B A 

1926 3.9102 0.8641 0.2306 
1927 Sense 5 0,8646 0.2325 
1928 3.5040 0.8642 0.2359 
1929 3.4062 0.8623 0.2435 
1930 3.4699 0.8614 0,2443 
1931 323765 0.8597 0.2433 
1932 352039 0.8610 0.2384 
1933 vou oidas, 0.8617 2.2355 
1934 2.9799 0.8622 0.2293 
1935 2.9228 0.8617 0.2334 
1936 2.8602 0.8614 0.2349 
1937 2.7969 0.8601 0.2422 
1938 2.8467 0.8592 2523 
1939 7 apa he of 0.8584 0.2560 
1940 2.8891 0.8557 0.2697 
1941 2.9420 0.8527 0.2811 
1942 3.0765 0.8520 0.2816 
1943 3.1550 0.8505 O.z2irz 
1944 3o1347 0.8528 0.2688 
1945 3. 1439 0.8541 0.2686 
1946 3.4845 0.8493 0.2865 
1947 3,6967 0.8466 0.3046 
1948 3.5406 0.8474 0.3083 
1949 3.5546 0.8465 053112 
1950 3.5499 0.8461 0.3131 
1951 3.5959 0.8460 0.3200 
1952 3.7368 0.8450 0.3262 
1953 3.8185 0.8446 0.3300 
1954 359289 0.8446 0g. 3339 
1955 3.9290 0.8437 0.3360 
1956 3.9553 0.8420 0.3417 
1957 4.0346 0.8420 0.3504 
1958 3.9849 0.8407 0.3541 
1959 4,0441 0.8396 0. 3586 
1960 4.0138 0.8393 0. 3609 
1961 3.9460 0.8379 0.3649 
1962 3.8566 0.8363 0.3655 
1963 361/419 0.8359 0.3650 
1964 3.6000 0.8361 0.3615 
1965 S Peso bop] 0; 8372 0.3645 
1966 2.8865 0.8362 oat Pee 2 
1967 2.6434 0.8333 0.3876 
1968 2.4877 0.8294 0.3945 


1969 . 2.4521 0.8294 0.3945 
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TABLE 8. The Percent Difference in the Upper Tail for 
Selected Cohort and Period Distributions 


Cohort 1920 30 


Period 1969 30 


Actual 


0.1783 
0.0494 
0.0044 


0.1089 
0.0239 
0.0031 


Percent 
Estimated Diff 
0.1815 21684 
0-0535 8,31 
0.00247 457.35 
0.1094 0.47 
0.0223 6.49 


0.0091 191.02 
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